C2H: A Computational Model of H&H-based Phonetic Contrast in Synthetic Speech
نویسندگان
چکیده
This paper presents a computational model of human speech production based on the hypothesis that low energy attractors for a human speech production system can be identified, and that interpolation/extrapolation along the key dimension of hypo/hyper-articulation can be motivated by energetic considerations of phonetic contrast. An HMM-based speech synthesiser along with continuous adaptation of its statistical models was used to implement the model. Two adaptation methods were proposed for vowel and consonant models and their effectiveness was tested by showing that such hypo/hyper-articulation control can manipulate successfully the intelligibility of synthetic speech in noise. Objective evaluations with the ANSI Speech Intelligibility Index indicate that intelligibility in various types of noise is effectively controlled. In particular, in the hyper-articulation transforms, the improvement with respect to unadapted speech is above 25 %.
منابع مشابه
Fuzzy linear regression model with crisp coefficients: A goal programming approach
The fuzzy linear regression model with fuzzy input-output data andcrisp coefficients is studied in this paper. A linear programmingmodel based on goal programming is proposed to calculate theregression coefficients. In contrast with most of the previous works, theproposed model takes into account the centers of fuzzy data as animportant feature as well as their spreads in the procedure ofconstr...
متن کاملArticulatory control of HMM-based parametric speech synthesis driven by phonetic knowledge
This paper presents a method to control the characteristics of synthetic speech flexibly by integrating articulatory features into a Hidden Markov Model (HMM)-based parametric speech synthesis system. In contrast to model adaptation and interpolation approaches for speaking style control, this method is driven by phonetic knowledge, and target speech samples are not required. The joint distribu...
متن کاملEnumeration of Conformers of Octahedral [M(ABC)_{6}] Complex on the Basis of Computational Group Theory
Conformers of [M(ABC)6] complex have been enumerated on the basis of computational group theory, where M is the central metal, and ABC is the ligand, bound to M through A. Based on the 16 conformers of the M(AB)6 core unit, 7173 conformers have been found for the [M(ABC)6] complex, which are assigned to nine point groups, 1 D3d, 4 D3, 4 S6, 5 C2h, 7 C3, 182 C2, 15 Cs, 23 Ci, and 6932 C1.
متن کاملمقایسه روش های طیفی برای شناسایی زبان گفتاری
Identifying spoken language automatically is to identify a language from the speech signal. Language identification systems can be divided into two categories, spectral-based methods and phonetic-based methods. In the former, short-time characteristics of speech spectrum are extracted as a multi-dimensional vector. The statistical model of these features is then obtained for each language. The ...
متن کاملSpeech enhancement based on hidden Markov model using sparse code shrinkage
This paper presents a new hidden Markov model-based (HMM-based) speech enhancement framework based on the independent component analysis (ICA). We propose analytical procedures for training clean speech and noise models by the Baum re-estimation algorithm and present a Maximum a posterior (MAP) estimator based on Laplace-Gaussian (for clean speech and noise respectively) combination in the HMM ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012